Npu grpo fix2 by addsubmuldiv · Pull Request #54 · modelscope/twinkle

addsubmuldiv · 2026-02-11T15:56:27Z

No description provided.

- Precompute decay and no-decay parameter name lists before optimizer group creation - Add explicit param_names field to optimizer groups for better debugging and transparency - Maintain identical functional behavior while improving code readability

Modify TransformersModel to only apply sp_strategy.postprocess_outputs when labels are None, preventing unintended postprocessing during training or evaluation with labels present. This ensures postprocessing is reserved for inference scenarios.

Add conditional loss reduction using sp_strategy when labels are present in inputs. This ensures that the loss calculation accounts for the sp_strategy's specific reduction logic, improving model training consistency and alignment with the strategy's objectives.

- Add comprehensive docstring to `_get_sp_group_from_device_mesh` explaining how SP groups are derived when no explicit "sp" mesh dimension exists - Include inline comments in backward passes and attention logic to clarify gradient handling and layout transformations - Improve readability and maintainability of sequence parallel implementation

sequence parallel

Expert parallel support twinkle.TransfomersModel

add server tps/rps control and queue

* update sampler * update sampler * updat * update sampler * update cpu env * update compat * update * update * update * update * update server * fix * fix * fix smaple * fix smaple * update * update * update server

Copilot

Pull request overview

Adds a large documentation + examples + CI/linting setup for the Twinkle project (both English and Chinese docs), plus several cookbook scripts to demonstrate local/Ray/remote workflows.

Changes:

Introduces full Sphinx doc trees for docs/source (ZH) and docs/source_en (EN), including new component/usage pages and custom autosummary templates.
Adds Read the Docs configs and doc build scripts/Make targets.
Adds CI workflows (lint, publish, stale issues, GPU/NPU CI) and updates pre-commit configuration; adds multiple cookbook examples for Transformers/Megatron/Ray/remote usage.

Reviewed changes

Copilot reviewed 245 out of 501 changed files in this pull request and generated 9 comments.

Show a summary per file

File	Description
docs/source_en/index.rst	Adds EN docs root toctree and navigation
docs/source_en/_templates/sobolengine.rst	Adds Sphinx autoclass template override
docs/source_en/_templates/classtemplate.rst	Adds Sphinx autoclass template override
docs/source_en/_templates/autosummary/class.rst	Adds autosummary class template
docs/source_en/Usage Guide/Server and Client/index.rst	Adds EN “Server and Client” section index
docs/source_en/Usage Guide/Server and Client/Overview.md	Adds EN overview for server/client architecture
docs/source_en/Usage Guide/ModelScope-Free-Resources.md	Adds EN guidance for free ModelScope resources
docs/source_en/Usage Guide/Installation.md	Adds EN installation + hardware support page
docs/source_en/Components/Training Middleware/index.rst	Adds EN training middleware nav
docs/source_en/Components/Training Middleware/RemoteClass.md	Adds EN remote execution decorator docs
docs/source_en/Components/Training Middleware/DeviceMesh-and-DeviceGroup.md	Adds EN DeviceMesh/DeviceGroup docs
docs/source_en/Components/Template/index.rst	Adds EN template nav
docs/source_en/Components/Template/Template.md	Adds EN template component docs
docs/source_en/Components/Task Processor/index.rst	Adds EN task processor nav
docs/source_en/Components/Task Processor/InputProcessor.md	Adds EN InputProcessor docs
docs/source_en/Components/Sampler/vLLMSampler.md	Adds EN vLLM sampler docs
docs/source_en/Components/Sampler/index.rst	Adds EN sampler nav
docs/source_en/Components/Sampler/TorchSampler.md	Adds EN torch sampler docs
docs/source_en/Components/Sampler/Sampler.md	Adds EN sampler interface docs
docs/source_en/Components/Reward/index.rst	Adds EN reward nav
docs/source_en/Components/Reward/Reward.md	Adds EN reward component docs
docs/source_en/Components/Preprocessor and Filter/index.rst	Adds EN preprocessor/filter nav
docs/source_en/Components/Preprocessor and Filter/Preprocessor.md	Adds EN preprocessor docs
docs/source_en/Components/Preprocessor and Filter/Filter.md	Adds EN filter docs
docs/source_en/Components/Plugin/index.rst	Adds EN plugin nav
docs/source_en/Components/Plugin/Plugin.md	Adds EN plugin loading/security docs
docs/source_en/Components/Patch/index.rst	Adds EN patch nav
docs/source_en/Components/Patch/Patch.md	Adds EN patch docs
docs/source_en/Components/Model/index.rst	Adds EN model nav
docs/source_en/Components/Model/TransformersModel.md	Adds EN TransformersModel docs
docs/source_en/Components/Model/MultiLoraTransformersModel.md	Adds EN MultiLoRA Transformers docs
docs/source_en/Components/Model/MultiLoraMegatronModel.md	Adds EN MultiLoRA Megatron docs
docs/source_en/Components/Model/MegatronModel.md	Adds EN MegatronModel docs
docs/source_en/Components/Metrics/index.rst	Adds EN metrics nav
docs/source_en/Components/Metrics/TrainMetric.md	Adds EN TrainMetric docs
docs/source_en/Components/Metrics/LossMetric.md	Adds EN LossMetric docs
docs/source_en/Components/Metrics/Building-Metrics.md	Adds EN metric authoring docs
docs/source_en/Components/Metrics/Accuracy.md	Adds EN Accuracy metric docs
docs/source_en/Components/Loss/index.rst	Adds EN loss nav
docs/source_en/Components/Loss/CrossEntropy.md	Adds EN cross-entropy loss docs
docs/source_en/Components/Loss/Building-Loss.md	Adds EN loss authoring docs
docs/source_en/Components/LRScheduler/index.rst	Adds EN LR scheduler nav
docs/source_en/Components/LRScheduler/LinearWarmupScheduler.md	Adds EN linear warmup scheduler docs
docs/source_en/Components/LRScheduler/CosineWarmupScheduler.md	Adds EN cosine warmup scheduler docs
docs/source_en/Components/Kernel/index.rst	Adds EN kernel nav
docs/source_en/Components/Dataset/index.rst	Adds EN dataset nav
docs/source_en/Components/Dataset/PackingDataset.md	Adds EN packing dataset docs
docs/source_en/Components/Dataset/LazyDataset.md	Adds EN lazy dataset docs
docs/source_en/Components/Dataset/IterablePackingDataset.md	Adds EN iterable packing dataset docs
docs/source_en/Components/Dataset/IterableDataset.md	Adds EN iterable dataset docs
docs/source_en/Components/Data Loading/index.rst	Adds EN data loading nav
docs/source_en/Components/Data Loading/DataLoader.md	Adds EN DataLoader docs
docs/source_en/Components/Data Format/index.rst	Adds EN data-format nav
docs/source_en/Components/Data Format/Trajectory.md	Adds EN Trajectory docs
docs/source_en/Components/Data Format/Sampling.md	Adds EN sampling types/docs
docs/source_en/Components/Data Format/Output.md	Adds EN model output type docs
docs/source_en/Components/Data Format/ModelOutput.md	Adds EN ModelOutput docs
docs/source_en/Components/Data Format/Message.md	Adds EN Message docs
docs/source_en/Components/Data Format/InputFeature.md	Adds EN InputFeature docs
docs/source_en/Components/Checkpoint Engine/index.rst	Adds EN checkpoint engine nav
docs/source_en/Components/Checkpoint Engine/NCCLCheckpointEngine.md	Adds EN NCCL checkpoint engine docs
docs/source_en/Components/Checkpoint Engine/HCCLCheckpointEngine.md	Adds EN HCCL checkpoint engine docs
docs/source_en/Components/Checkpoint Engine/CheckpointEngine.md	Adds EN checkpoint engine interface docs
docs/source_en/Components/Advantage/index.rst	Adds EN advantage nav
docs/source_en/Components/Advantage/RLOOAdvantage.md	Adds EN RLOO advantage docs
docs/source_en/Components/Advantage/GRPOAdvantage.md	Adds EN GRPO advantage docs
docs/source_en/Components/Advantage/Advantage.md	Adds EN advantage interface docs
docs/source_en/.readthedocs.yaml	Adds RTD config for EN docs build
docs/source/组件/预处理器和过滤器/index.rst	Adds ZH preprocessor/filter nav
docs/source/组件/预处理器和过滤器/Preprocessor.md	Adds ZH preprocessor docs
docs/source/组件/预处理器和过滤器/Filter.md	Adds ZH filter docs
docs/source/组件/采样器/vLLMSampler.md	Adds ZH vLLM sampler docs
docs/source/组件/采样器/index.rst	Adds ZH sampler nav
docs/source/组件/采样器/TorchSampler.md	Adds ZH torch sampler docs
docs/source/组件/采样器/Sampler.md	Adds ZH sampler docs
docs/source/组件/训练中间件/index.rst	Adds ZH training middleware nav
docs/source/组件/训练中间件/RemoteClass.md	Adds ZH remote execution decorator docs
docs/source/组件/训练中间件/DeviceMesh和DeviceGroup.md	Adds ZH DeviceMesh/DeviceGroup docs
docs/source/组件/补丁/index.rst	Adds ZH patch nav
docs/source/组件/补丁/Patch.md	Adds ZH patch docs
docs/source/组件/组件化/index.rst	Adds ZH plugin nav
docs/source/组件/组件化/Plugin.md	Adds ZH plugin docs
docs/source/组件/模板/index.rst	Adds ZH template nav
docs/source/组件/模板/Template.md	Adds ZH template docs
docs/source/组件/模型/index.rst	Adds ZH model nav
docs/source/组件/模型/TransformersModel.md	Adds ZH TransformersModel docs
docs/source/组件/模型/MultiLoraTransformersModel.md	Adds ZH MultiLoRA Transformers docs
docs/source/组件/模型/MultiLoraMegatronModel.md	Adds ZH MultiLoRA Megatron docs
docs/source/组件/模型/MegatronModel.md	Adds ZH MegatronModel docs
docs/source/组件/检查点引擎/index.rst	Adds ZH checkpoint engine nav
docs/source/组件/检查点引擎/NCCLCheckpointEngine.md	Adds ZH NCCL checkpoint engine docs
docs/source/组件/检查点引擎/HCCLCheckpointEngine.md	Adds ZH HCCL checkpoint engine docs
docs/source/组件/检查点引擎/CheckpointEngine.md	Adds ZH checkpoint engine interface docs
docs/source/组件/数据集/index.rst	Adds ZH dataset nav
docs/source/组件/数据集/PackingDataset.md	Adds ZH packing dataset docs
docs/source/组件/数据集/LazyDataset.md	Adds ZH lazy dataset docs
docs/source/组件/数据集/IterablePackingDataset.md	Adds ZH iterable packing dataset docs
docs/source/组件/数据集/IterableDataset.md	Adds ZH iterable dataset docs
docs/source/组件/数据格式/index.rst	Adds ZH data-format nav
docs/source/组件/数据格式/Trajectory.md	Adds ZH Trajectory docs
docs/source/组件/数据格式/Sampling.md	Adds ZH sampling docs
docs/source/组件/数据格式/Output.md	Adds ZH model output docs
docs/source/组件/数据格式/ModelOutput.md	Adds ZH ModelOutput docs (but currently inconsistent)
docs/source/组件/数据格式/Message.md	Adds ZH Message docs
docs/source/组件/数据格式/InputFeature.md	Adds ZH InputFeature docs
docs/source/组件/数据加载/index.rst	Adds ZH data loading nav
docs/source/组件/数据加载/DataLoader.md	Adds ZH DataLoader docs
docs/source/组件/损失/构建损失.md	Adds ZH loss authoring docs
docs/source/组件/损失/index.rst	Adds ZH loss nav
docs/source/组件/损失/CrossEntropy.md	Adds ZH cross-entropy loss docs
docs/source/组件/指标/构建指标.md	Adds ZH metric authoring docs
docs/source/组件/指标/index.rst	Adds ZH metrics nav
docs/source/组件/指标/TrainMetric.md	Adds ZH TrainMetric docs
docs/source/组件/指标/LossMetric.md	Adds ZH LossMetric docs
docs/source/组件/指标/Accuracy.md	Adds ZH Accuracy metric docs
docs/source/组件/奖励/index.rst	Adds ZH reward nav
docs/source/组件/奖励/Reward.md	Adds ZH reward docs
docs/source/组件/内核/index.rst	Adds ZH kernel nav
docs/source/组件/优势/index.rst	Adds ZH advantage nav
docs/source/组件/优势/RLOOAdvantage.md	Adds ZH RLOO advantage docs
docs/source/组件/优势/GRPOAdvantage.md	Adds ZH GRPO advantage docs
docs/source/组件/优势/Advantage.md	Adds ZH advantage interface docs
docs/source/组件/任务处理器/index.rst	Adds ZH task processor nav
docs/source/组件/任务处理器/InputProcessor.md	Adds ZH InputProcessor docs
docs/source/组件/LRScheduler/index.rst	Adds ZH LR scheduler nav
docs/source/组件/LRScheduler/LinearWarmupScheduler.md	Adds ZH linear warmup scheduler docs
docs/source/组件/LRScheduler/CosineWarmupScheduler.md	Adds ZH cosine warmup scheduler docs
docs/source/使用指引/魔搭免费资源.md	Adds ZH free ModelScope resources doc
docs/source/使用指引/服务端和客户端/index.rst	Adds ZH server/client nav
docs/source/使用指引/安装.md	Adds ZH installation doc
docs/source/index.rst	Adds ZH docs root toctree and navigation
docs/source/_templates/sobolengine.rst	Adds ZH Sphinx template override
docs/source/_templates/classtemplate.rst	Adds ZH Sphinx template override
docs/source/_templates/autosummary/class.rst	Adds ZH autosummary class template
docs/source/.readthedocs.yaml	Adds RTD config for ZH docs build
docs/make.bat	Adds Windows Sphinx build helper
docs/README.md	Adds docs maintenance readme
docs/Makefile	Adds docs Makefile
cookbook/transformers/sp_fsdp_dense.sh	Adds single-node FSDP+SP example launcher
cookbook/transformers/sp_fsdp_dense.py	Adds Transformers native FSDP dense example code
cookbook/transformers/fsdp2_moe.sh	Adds Transformers FSDP2 MoE launcher
cookbook/transformers/fsdp2.sh	Adds Transformers FSDP2 launcher
cookbook/transformers/ep_fsdp_qwen3_moe.sh	Adds EP+FSDP2 MoE launcher
cookbook/transformers/ep_fsdp_qwen3_moe.py	Adds EP+FSDP2 MoE example code
cookbook/ray/run.sh	Adds ray cookbook runner
cookbook/megatron/tp_moe.sh	Adds Megatron MoE launcher
cookbook/megatron/tp.sh	Adds Megatron TP launcher
cookbook/megatron/tp.py	Adds Megatron example code
cookbook/legacy/single_program_full.py	Adds legacy full training example
cookbook/legacy/sft/streaming_dataset.py	Adds legacy streaming dataset example
cookbook/legacy/sft/single_program_moe.py	Adds legacy MoE training example
cookbook/legacy/sft/single_program_megatron_full.py	Adds legacy Megatron full example
cookbook/legacy/sft/single_program_megatron.py	Adds legacy Megatron LoRA example
cookbook/legacy/sft/multi_lora.py	Adds legacy multi-LoRA example
cookbook/legacy/sft/local_dataset.py	Adds legacy local dataset example
cookbook/legacy/sft/full_sft.py	Adds legacy full SFT example
cookbook/legacy/sft/ep_fsdp_qwen3_moe.py	Adds legacy EP+FSDP MoE example
cookbook/legacy/sampler/sampler_demo.py	Adds legacy sampler demo
cookbook/legacy/remote/twinkle/server_config.yaml	Adds legacy twinkle server config
cookbook/legacy/remote/twinkle/server.py	Adds legacy twinkle server launcher
cookbook/legacy/remote/twinkle/lora.py	Adds legacy twinkle client training
cookbook/legacy/remote/tinker/server_config.yaml	Adds legacy tinker server config
cookbook/legacy/remote/tinker/server.py	Adds legacy tinker server launcher
cookbook/legacy/remote/tinker/ascend/server_config.yaml	Adds legacy ascend tinker config
cookbook/legacy/remote/tinker/ascend/server.py	Adds legacy ascend server launcher
cookbook/legacy/npu/lora_npu.py	Adds legacy NPU LoRA example
cookbook/legacy/components/dataset.py	Adds minimal dataset snippet
cookbook/client/twinkle/transformer/server.py	Adds client cookbook server launcher (transformers)
cookbook/client/twinkle/transformer/sampler.py	Adds client cookbook sampler example
cookbook/client/twinkle/megatron/server_config.yaml	Adds client cookbook server config (megatron)
cookbook/client/twinkle/megatron/server.py	Adds client cookbook server launcher (megatron)
cookbook/client/tinker/transformer/server_config.yaml	Adds client cookbook server config (tinker transformers)
cookbook/client/tinker/transformer/server.py	Adds client cookbook server launcher (tinker transformers)
cookbook/client/tinker/transformer/sample.py	Adds client cookbook sampling example (tinker)
cookbook/client/tinker/megatron/server.py	Adds client cookbook server launcher (tinker megatron)
ROADMAP.md	Adds roadmap document
.pre-commit-config_local.yaml	Updates local pre-commit excludes/hooks
.pre-commit-config.yaml	Updates CI pre-commit versions/excludes/hooks
.github/workflows/publish.yaml	Adds release publishing workflow
.github/workflows/lint.yaml	Adds pre-commit lint workflow
.github/workflows/close_tale_issue.yaml	Adds stale issue auto-close workflow
.github/workflows/citest_npu.yaml	Adds NPU CI workflow
.github/workflows/citest.yaml	Adds general CI workflow
.github/SECURITY.md	Adds security reporting guide
.github/PULL_REQUEST_TEMPLATE.md	Adds PR template
.github/ISSUE_TEMPLATE/config.yml	Adds issue template configuration
.github/ISSUE_TEMPLATE/3-question-discussion.yml	Adds question/discussion issue template
.github/ISSUE_TEMPLATE/2-feature-request.yml	Adds feature request issue template
.github/ISSUE_TEMPLATE/1-bug-report.yml	Adds bug report issue template
.dev_scripts/dockerci_npu.sh	Adds/updates NPU CI runner script
.dev_scripts/dockerci.sh	Adds/updates GPU CI runner script
.dev_scripts/ci_container_test.sh	Adds/updates CI container test entrypoint
.dev_scripts/build_docs.sh	Adds docs build helper script

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-11T15:58:56Z

docs/source_en/index.rst

+.. twinkle documentation file,
+   You can adapt this file completely to your liking, but it should at least
+   contain the root `toctree` directive.


The file starts with leading indentation before the .. directive; Sphinx/reStructuredText directives must start at column 0. Remove the two leading spaces so the initial comment is parsed correctly.

Copilot · 2026-02-11T15:58:57Z

docs/source_en/Usage Guide/Server and Client/index.rst

@@ -0,0 +1,9 @@
+Server and Client
+===============


The section underline is shorter than the title, which triggers Sphinx warnings ("Title underline too short."). Make the underline length at least the same as Server and Client.

Suggested change

===============

=================

Copilot · 2026-02-11T15:58:57Z

docs/source/组件/数据格式/ModelOutput.md

+# 模型输入
+
+twinkle用于表示模型输入的类是`InputFeature`，该类适配于transformers/megatron等模型结构。


This page is named ModelOutput.md and defines ModelOutput, but the title and description talk about model input (InputFeature). Update the title and text to match ModelOutput (模型输出) to avoid misleading documentation.

Suggested change

# 模型输入

twinkle用于表示模型输入的类是`InputFeature`，该类适配于transformers/megatron等模型结构。

# 模型输出

twinkle用于表示模型输出的类是`ModelOutput`，该类适配于transformers/megatron等模型结构的输出。

Copilot · 2026-02-11T15:58:57Z

docs/source_en/Components/Preprocessor and Filter/Filter.md

@@ -0,0 +1,27 @@
+# Filter
+
+The preprocessor is a script for data ETL. Its role is to convert messy, uncleaned data into standardized, cleaned data. The preprocessing method supported by Twinkle runs on the dataset.map method.


This is the Filter documentation page, but the opening paragraph describes a preprocessor and references dataset.map. Consider rewriting this paragraph to describe filtering (e.g., returning a boolean, typically via dataset.filter).

Suggested change

The preprocessor is a script for data ETL. Its role is to convert messy, uncleaned data into standardized, cleaned data. The preprocessing method supported by Twinkle runs on the dataset.map method.

A filter is used to select which data samples should be kept. It receives a raw row and returns a boolean indicating whether the row should be included in the dataset, and is typically applied via the `dataset.filter` method.

Copilot · 2026-02-11T15:58:58Z

cookbook/legacy/sft/multi_lora.py

+    for step, batch in enumerate(dataloader):
+        model.forward_backward(inputs=batch, adapter_name='default')
+        if step > 0 and step % 20 == 0:
+            logger.info(f'Current is step {step // 4} of {len(dataloader)//4}, metric: {model.calculate_metric(is_training=True, adapter_name='default')}')


This line is a Python syntax error because the outer f-string uses single quotes and the inner adapter_name='default' also uses single quotes. Use different quote types (e.g., outer double quotes) or escape the inner quotes.

Suggested change

logger.info(f'Current is step {step // 4} of {len(dataloader)//4}, metric: {model.calculate_metric(is_training=True, adapter_name='default')}')

logger.info(f"Current is step {step // 4} of {len(dataloader)//4}, metric: {model.calculate_metric(is_training=True, adapter_name='default')}")

Copilot · 2026-02-11T15:58:58Z

cookbook/transformers/sp_fsdp_dense.py

+    dataset = Dataset(
+        dataset_meta=DatasetMeta(DATASETS, data_slice=range(500))


create_dataset accepts data_slice but ignores it and always uses range(500), which makes partial(create_dataset, data_slice=...) ineffective in callers. Use data_slice when constructing DatasetMeta (and keep the existing default behavior when data_slice is None).

Suggested change

dataset = Dataset(

dataset_meta=DatasetMeta(DATASETS, data_slice=range(500))

if data_slice is None:

data_slice = range(500)

dataset = Dataset(

dataset_meta=DatasetMeta(DATASETS, data_slice=data_slice)

Copilot · 2026-02-11T15:58:58Z

docs/source_en/Components/Template/Template.md

+  - raise: Throw an exception. Generally used for very precise dataset scenarios
+  - left: Remove tokens on the left to conform to max_length
+  - right: Remove tokens on the right to conform to max_length
+  - default_system: If the dataset does not have a system, use the default system


default_system appears to be a separate top-level parameter (see the constructor signature), but it is currently formatted as a sub-bullet under truncation_strategy. Move default_system to its own top-level bullet to reflect the API correctly.

Suggested change

- default_system: If the dataset does not have a system, use the default system

- default_system: If the dataset does not have a system, use the default system

Copilot · 2026-02-11T15:58:58Z

docs/source_en/Components/Data Format/Output.md

+    return ModelOutput(
+        logits=logits,
+        loss=loss
+    )


ModelOutput is described as a TypedDict elsewhere; TypedDict types are not callable at runtime. Consider changing this example to return a regular dict (e.g., {'logits': logits, 'loss': loss}) to avoid presenting code that would fail if copy-pasted.

Suggested change

return ModelOutput(

logits=logits,

loss=loss

)

return {

"logits": logits,

"loss": loss,

}

Copilot · 2026-02-11T15:58:59Z

docs/README.md

@@ -0,0 +1,37 @@
+## maintain docs


Capitalize the heading for consistency with typical Markdown style (e.g., 'Maintain docs').

Suggested change

## maintain docs

## Maintain Docs

gemini-code-assist · 2026-02-11T15:59:50Z

Summary of Changes

Hello @addsubmuldiv, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request represents a major overhaul and rebranding of the project, transitioning to a new client-server training framework named 'Twinkle'. The core purpose is to provide a more modular, flexible, and production-ready environment for large language model training, supporting diverse hardware (including NPU) and distributed computing paradigms. The changes significantly enhance the developer experience through improved documentation, comprehensive examples, and streamlined contribution workflows.

Highlights

Framework Rebranding and Architecture: The project has been rebranded from 'SWIFT' to 'Twinkle', introducing a new client-server training framework with modular, high-cohesion interfaces supporting torchrun, Ray, and HTTP runtime modes.
Comprehensive Documentation: Extensive documentation has been added for both English and Chinese audiences, covering installation, quick start guides, server/client architecture, NPU support, and detailed explanations of various components (Dataset, Model, Sampler, Loss, Metrics, etc.).
Expanded Cookbook Examples: A wide array of new cookbook examples are included, demonstrating various training scenarios such as LoRA fine-tuning, GRPO reinforcement learning, and usage with Transformers and Megatron backends, across both native Twinkle and Tinker-compatible client APIs.
NPU Support Integration: Dedicated scripts and documentation for Huawei Ascend NPU environments have been added, including specific CI container tests and detailed guides on installation, parallelization strategies, and feature support status.
Developer Workflow Enhancements: New GitHub issue and pull request templates have been introduced to streamline contributions, alongside updated pre-commit hooks for code quality and a .github/copilot-instructions.md for AI agent guidance.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Changelog

.gitignore
- Updated ignored files to include new lock directories and test-related files.
.pre-commit-config.yaml
- Updated pre-commit hook versions and modified exclusion patterns.
.pre-commit-config_local.yaml
- Updated local pre-commit hook exclusion patterns.
CONTRIBUTING.md
- Updated contributor guidelines to reflect the new project name and contribution focus.
CONTRIBUTING_CN.md
- Updated Chinese contributor guidelines to reflect the new project name and contribution focus.
README.md
- Rewrote the main README to introduce the Twinkle framework and its features.

Ignored Files

Ignored by pattern: .github/workflows/** (5)
- .github/workflows/citest.yaml
- .github/workflows/citest_npu.yaml
- .github/workflows/close_tale_issue.yaml
- .github/workflows/lint.yaml
- .github/workflows/publish.yaml

Activity

The pull request introduces a significant amount of new code, particularly in .dev_scripts/, .github/, client_tools/, cookbook/, and docs/.
The changes suggest a foundational shift in the project's structure and focus, moving towards a more robust and documented client-server training framework.
The author addsubmuldiv has initiated a major update, likely a new version release or a significant architectural change.
No human activity (comments, reviews) is provided in the context.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a significant number of new files, including CI scripts, documentation, and example usage for the 'twinkle' project. The changes seem to focus on setting up the repository structure, CI/CD pipelines for both GPU and NPU environments, and providing comprehensive examples and documentation.

My review has identified several areas for improvement:

CI Scripts: There's duplicated code in dockerci.sh and a potential security vulnerability with the use of eval in dockerci_npu.sh.
Repository Configuration: The pre-commit-config.yaml files have been updated, but the list of hooks for pre-commit-hooks has been accidentally removed, which will disable important static checks.
Documentation: The new README.md files contain broken links that need to be fixed.
Example Code: Some example scripts are either incomplete or use practices like globally disabling torch.dynamo which could be reviewed for performance implications.

Overall, this is a substantial and valuable contribution to setting up the project. Addressing the identified issues will improve the maintainability, security, and correctness of the repository.

gemini-code-assist · 2026-02-11T16:05:47Z

.dev_scripts/dockerci.sh

+  if [ "$MODELSCOPE_SDK_DEBUG" == "True" ]; then
+    echo 'debugging'
+    docker run --rm --name $CONTAINER_NAME --shm-size=16gb \
+              --cpuset-cpus=${cpu_sets_arr[$idx]} \
+              --gpus='"'"device=$gpu"'"' \
+              -v $CODE_DIR:$CODE_DIR_IN_CONTAINER \
+              -v $MODELSCOPE_CACHE:$MODELSCOPE_CACHE_DIR_IN_CONTAINER \
+              -v $MODELSCOPE_HOME_CACHE/$idx:/root \
+              -v /home/admin/pre-commit:/home/admin/pre-commit \
+              -e CI_TEST=True \
+              -e TEST_LEVEL=$TEST_LEVEL \
+              -e MODELSCOPE_CACHE=$MODELSCOPE_CACHE_DIR_IN_CONTAINER \
+              -e MODELSCOPE_DOMAIN=$MODELSCOPE_DOMAIN \
+              -e MODELSCOPE_SDK_DEBUG=True \
+              -e HUB_DATASET_ENDPOINT=$HUB_DATASET_ENDPOINT \
+              -e TEST_ACCESS_TOKEN_CITEST=$TEST_ACCESS_TOKEN_CITEST \
+              -e TEST_ACCESS_TOKEN_SDKDEV=$TEST_ACCESS_TOKEN_SDKDEV \
+              -e TEST_LEVEL=$TEST_LEVEL \
+              -e MODELSCOPE_ENVIRONMENT='ci' \
+              -e TEST_UPLOAD_MS_TOKEN=$TEST_UPLOAD_MS_TOKEN \
+              -e MODEL_TAG_URL=$MODEL_TAG_URL \
+              -e MODELSCOPE_API_TOKEN=$MODELSCOPE_API_TOKEN \
+	            -e PR_CHANGED_FILES=$PR_CHANGED_FILES \
+              --workdir=$CODE_DIR_IN_CONTAINER \
+              ${IMAGE_NAME}:${IMAGE_VERSION} \
+              $CI_COMMAND
+  else
+    docker run --rm --name $CONTAINER_NAME --shm-size=16gb \
+              --cpuset-cpus=${cpu_sets_arr[$idx]} \
+              --gpus='"'"device=$gpu"'"' \
+              -v $CODE_DIR:$CODE_DIR_IN_CONTAINER \
+              -v $MODELSCOPE_CACHE:$MODELSCOPE_CACHE_DIR_IN_CONTAINER \
+              -v $MODELSCOPE_HOME_CACHE/$idx:/root \
+              -v /home/admin/pre-commit:/home/admin/pre-commit \
+              -e CI_TEST=True \
+              -e TEST_LEVEL=$TEST_LEVEL \
+              -e MODELSCOPE_CACHE=$MODELSCOPE_CACHE_DIR_IN_CONTAINER \
+              -e MODELSCOPE_DOMAIN=$MODELSCOPE_DOMAIN \
+              -e HUB_DATASET_ENDPOINT=$HUB_DATASET_ENDPOINT \
+              -e TEST_ACCESS_TOKEN_CITEST=$TEST_ACCESS_TOKEN_CITEST \
+              -e TEST_ACCESS_TOKEN_SDKDEV=$TEST_ACCESS_TOKEN_SDKDEV \
+              -e TEST_LEVEL=$TEST_LEVEL \
+              -e MODELSCOPE_ENVIRONMENT='ci' \
+              -e TEST_UPLOAD_MS_TOKEN=$TEST_UPLOAD_MS_TOKEN \
+              -e MODEL_TAG_URL=$MODEL_TAG_URL \
+              -e MODELSCOPE_API_TOKEN=$MODELSCOPE_API_TOKEN \
+	            -e PR_CHANGED_FILES=$PR_CHANGED_FILES \
+              --workdir=$CODE_DIR_IN_CONTAINER \
+              ${IMAGE_NAME}:${IMAGE_VERSION} \
+              $CI_COMMAND
+  fi


There is a large amount of duplicated code between the if and else blocks for running the docker container. This makes the script hard to read and maintain. Any changes to the docker run command will need to be applied in two places, which is error-prone.

You can refactor this by storing the common docker arguments in an array and conditionally adding the debug-specific arguments. This also allows you to remove the duplicated -e TEST_LEVEL=$TEST_LEVEL flag.

docker_args=( --rm --name "$CONTAINER_NAME" --shm-size=16gb --cpuset-cpus="${cpu_sets_arr[$idx]}" --gpus='"'"device=$gpu"'"' -v "$CODE_DIR:$CODE_DIR_IN_CONTAINER" -v "$MODELSCOPE_CACHE:$MODELSCOPE_CACHE_DIR_IN_CONTAINER" -v "$MODELSCOPE_HOME_CACHE/$idx:/root" -v /home/admin/pre-commit:/home/admin/pre-commit -e CI_TEST=True -e TEST_LEVEL="$TEST_LEVEL" -e MODELSCOPE_CACHE="$MODELSCOPE_CACHE_DIR_IN_CONTAINER" -e MODELSCOPE_DOMAIN="$MODELSCOPE_DOMAIN" -e HUB_DATASET_ENDPOINT="$HUB_DATASET_ENDPOINT" -e TEST_ACCESS_TOKEN_CITEST="$TEST_ACCESS_TOKEN_CITEST" -e TEST_ACCESS_TOKEN_SDKDEV="$TEST_ACCESS_TOKEN_SDKDEV" -e MODELSCOPE_ENVIRONMENT='ci' -e TEST_UPLOAD_MS_TOKEN="$TEST_UPLOAD_MS_TOKEN" -e MODEL_TAG_URL="$MODEL_TAG_URL" -e MODELSCOPE_API_TOKEN="$MODELSCOPE_API_TOKEN" -e PR_CHANGED_FILES="$PR_CHANGED_FILES" --workdir="$CODE_DIR_IN_CONTAINER" ) if [ "$MODELSCOPE_SDK_DEBUG" == "True" ]; then echo 'debugging' docker_args+=(-e MODELSCOPE_SDK_DEBUG=True) fi docker run "${docker_args[@]}" "${IMAGE_NAME}:${IMAGE_VERSION}" $CI_COMMAND

gemini-code-assist · 2026-02-11T16:05:47Z

.dev_scripts/dockerci_npu.sh

+
+  # 切换到代码目录并执行命令
+  cd $CODE_DIR
+  eval $CI_COMMAND


Using eval with a variable that can be set from an environment variable (CI_COMMAND) is a significant security risk. It can allow arbitrary command execution if the CI_COMMAND environment variable is manipulated. It's strongly recommended to avoid eval here. If the command structure is known, you can parse it and execute it more safely, for example by using an array.

gemini-code-assist · 2026-02-11T16:05:47Z

.pre-commit-config.yaml

  - repo: https://github.com/pre-commit/pre-commit-hooks.git
-    rev: v3.1.0
+    rev: v6.0.0
    hooks:


The hooks section for pre-commit-hooks is empty. This means none of the useful checks from this repository (like trailing-whitespace, check-yaml, end-of-file-fixer, etc.) will be run. This seems unintentional. Please add back the desired hooks.

hooks: - id: trailing-whitespace - id: end-of-file-fixer - id: check-yaml - id: check-added-large-files - id: check-merge-conflict

gemini-code-assist · 2026-02-11T16:05:48Z

.pre-commit-config_local.yaml

            )$
  - repo: /home/admin/pre-commit/pre-commit-hooks
    rev: v3.1.0
    hooks:


The hooks section for pre-commit-hooks is empty. This means none of the useful checks from this repository will be run. This seems unintentional. Please add back the desired hooks.

hooks: - id: trailing-whitespace - id: end-of-file-fixer - id: check-yaml - id: check-added-large-files - id: check-merge-conflict

gemini-code-assist · 2026-02-11T16:05:48Z

README.md

+## Supported Models
+We will be adding support for more models as new models are released. The following table lists current models
+supported on Twinkle✨ framework. However, the models supported on our serverless training backend may be a
+much smaller subset. Please refer to the [doc](link) section for more information.


This link to the documentation is broken. It currently points to (link). Please update it to point to the correct location.

gemini-code-assist · 2026-02-11T16:05:48Z

README_ZH.md

+
+## 更新日志
+
+- 🎉2026-02-10 twinkle-kit第一版编写完成，包含纯文本模型SFT/PT/RL和远程训练能力，并支持了[魔搭官方免费资源]()


此处的链接是空的，它指向 ()。请更新为指向魔搭官方免费资源的正确链接。

gemini-code-assist · 2026-02-11T16:05:48Z

cookbook/legacy/grpo/lora_npu.py

+
+
+def train():
+    raise NotImplementedError("Not implemented")


The train function is implemented to always raise a NotImplementedError, which makes the script non-functional as it's called from the if __name__ == '__main__' block. Was this intentional? If this script is meant to be runnable, you should implement the train function.

gemini-code-assist · 2026-02-11T16:05:48Z

cookbook/legacy/sft/single_program_megatron.py

+from twinkle.model import MultiLoraMegatronModel, MegatronModel
+from twinkle.preprocessor import SelfCognitionProcessor
+import torch
+torch._dynamo.disable()


Disabling TorchDynamo globally with torch._dynamo.disable() might be necessary for compatibility with Megatron-LM, but it prevents potential performance optimizations from torch.compile. Is this disable call strictly necessary for this script to run? If only certain parts are incompatible, consider using torch._dynamo.disable as a context manager or function decorator to scope it more narrowly.

gemini-code-assist · 2026-02-11T16:05:48Z

cookbook/legacy/sft/single_program_megatron_full.py

+import os
+os.environ["CUDA_DEVICE_MAX_CONNECTIONS"] = "1"
+import torch
+torch._dynamo.disable()


Disabling TorchDynamo globally with torch._dynamo.disable() might be necessary for compatibility with Megatron-LM, but it prevents potential performance optimizations from torch.compile. Is this disable call strictly necessary for this script to run? If only certain parts are incompatible, consider using torch._dynamo.disable as a context manager or function decorator to scope it more narrowly.

gemini-code-assist · 2026-02-11T16:05:48Z

cookbook/legacy/sft/single_program_moe.py

+from twinkle.model import MultiLoraMegatronModel, MegatronModel, TransformersModel
+from twinkle.preprocessor import SelfCognitionProcessor
+import torch
+torch._dynamo.disable()


Disabling TorchDynamo globally with torch._dynamo.disable() might be necessary for compatibility with Megatron-LM, but it prevents potential performance optimizations from torch.compile. Is this disable call strictly necessary for this script to run? If only certain parts are incompatible, consider using torch._dynamo.disable as a context manager or function decorator to scope it more narrowly.

meichangsu1 and others added 30 commits February 2, 2026 18:56

wip

9ca28d0

Merge commit '797078ac840723a59e32096bc164c5a86200ed68' into dev

520991e

refactor processor

b713d66

wip

6561350

fix

9e4d950

wip

853f289

Merge pull request #25 from modelscope/sp_ljl_dev

3837bc9

sequence parallel

wip

99ffea8

Merge remote-tracking branch 'origin/dev' into expert_parallel

fadd7ac

update queue

20fed75

wip

f485955

Merge pull request #26 from modelscope/sp_ljl_dev

7b4acd9

sequence parallel

refactor expert parallel strategy and add grad clipping utility

1cceb74

Merge pull request #23 from modelscope/expert_parallel

5cda1af

Expert parallel support twinkle.TransfomersModel

update queue

a3b761e

update queue

aeb2027

update queue

5b285e7

update queue

27768d4

update queue

6a004af

update queue

51cc9ba

fix

dbd0c52

Merge branch 'dev' of https://github.com/modelscope/twinkle into dev

b531079

fix

d83ce5a

tinker api npu adapt

896170a

update queue

21ed267

Merge pull request #27 from modelscope/tps_control

109a8d7

add server tps/rps control and queue

tastelikefeet and others added 25 commits February 10, 2026 12:53

fix multi lora megatron

58388ef

fix config

edc9b36

fix moe sync weights

595d97c

moe sync weights

d3cc3a0

add resource warning

808cadb

fix skip ranks

e2537a6

fix

42a9934

Merge branch 'dev' of github.com:modelscope/twinkle into dev

2995fbb

fix deployment

9fbffc8

update readme

6d8cbec

update reademe

74b5922

fix megatron load

c6d9ad1

0210-improve

197cd2a

Add rl example (#49)

c98fee9

* update sampler * update sampler * updat * update sampler * update cpu env * update compat * update * update * update * update * update server * fix * fix * fix smaple * fix smaple * update * update * update server

update readme

46cd9ad

update sampler&script

5890afa

Merge branch 'dev' of github.com:modelscope/twinkle into dev

f8ae876

wip

6488986

fix

de6b8a3

fix

baaa426

fix

ba835f2

change config

08ee10a

fix

3e4c283

fix

b941858

fix npu grpo

a8450de

Copilot AI review requested due to automatic review settings February 11, 2026 15:56

addsubmuldiv closed this Feb 11, 2026

Copilot AI reviewed Feb 11, 2026

View reviewed changes

gemini-code-assist bot reviewed Feb 11, 2026

View reviewed changes

		# 模型输入

		twinkle用于表示模型输入的类是`InputFeature`，该类适配于transformers/megatron等模型结构。

		@@ -0,0 +1,27 @@
		# Filter

		The preprocessor is a script for data ETL. Its role is to convert messy, uncleaned data into standardized, cleaned data. The preprocessing method supported by Twinkle runs on the dataset.map method.

	The preprocessor is a script for data ETL. Its role is to convert messy, uncleaned data into standardized, cleaned data. The preprocessing method supported by Twinkle runs on the dataset.map method.
	A filter is used to select which data samples should be kept. It receives a raw row and returns a boolean indicating whether the row should be included in the dataset, and is typically applied via the `dataset.filter` method.

	logger.info(f'Current is step {step // 4} of {len(dataloader)//4}, metric: {model.calculate_metric(is_training=True, adapter_name='default')}')
	logger.info(f"Current is step {step // 4} of {len(dataloader)//4}, metric: {model.calculate_metric(is_training=True, adapter_name='default')}")

		dataset = Dataset(
		dataset_meta=DatasetMeta(DATASETS, data_slice=range(500))

	- default_system: If the dataset does not have a system, use the default system
	- default_system: If the dataset does not have a system, use the default system


		## 更新日志

		- 🎉2026-02-10 twinkle-kit第一版编写完成，包含纯文本模型SFT/PT/RL和远程训练能力，并支持了[魔搭官方免费资源]()

Conversation

addsubmuldiv commented Feb 11, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot commented Feb 11, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development